Feature fusion within local region using localized maximum-margin learning for scene categorization
نویسندگان
چکیده
In the field of visual recognition such as scene categorization, representing an image based on the local feature (e.g. the bags of visual words (BOW) model and its variants) has become popular and one of the most successful methods. In this paper, we propose a method that uses localized maximum-margin learning to fuse different types of features during the BOW modeling process for eventual scene classification. Unlike previous feature fusion methods for visual recognition, which combines the features after generating the entire set of representations based on different types of features from local regions, the proposed method fuses different features at the stage when the best visual word is selected to represent a local region (hard assignment) or the probabilities of the candidate visual words used to represent the unknown region are estimated (soft assignment). The merits of the proposed method are that (1) errors caused by the ambiguity of single feature when assigning local regions to the best representative visual words can be corrected or the probabilities of the candidate visual words used to represent the region can be estimated more accurately; and that (2) it offers a more flexible way in fusing these features through determining the similarity-metric locally by localized maximum-margin learning. The proposed method has been evaluated experimentally and the results indicate its effectiveness.
منابع مشابه
Large Margin Learning of Upstream Scene Understanding Models
Upstream supervised topic models have been widely used for complicated scene understanding. However, existing maximum likelihood estimation (MLE) schemes can make the prediction model learning independent of latent topic discovery and result in an imbalanced prediction rule for scene classification. This paper presents a joint max-margin and max-likelihood learning method for upstream scene und...
متن کاملDLANet: A manifold-learning-based discriminative feature learning network for scene classification
This paper presents Discriminative Locality Alignment Network (DLANet), a novel manifold-learningbased discriminative learnable feature, for wild scene classification. Based on a convolutional structure, DLANet learns the filters of multiple layers by applying DLA and exploits the block-wise histograms of the binary codes of feature maps to generate the local descriptors. A DLA layer maximizes ...
متن کاملVisual object categorization based on the fusion of region and local features
This paper presents a novel approach for visual object categorization using region based features and statistical measures based image modeling. Our region-based features are extracted from coarse regions obtained by the Gestalt theory inspired region segmentation algorithm and they capture visually significant information such as segments and colors. The modeling of the visual content of an im...
متن کاملAn efficient method for cloud detection based on the feature-level fusion of Landsat-8 OLI spectral bands in deep convolutional neural network
Cloud segmentation is a critical pre-processing step for any multi-spectral satellite image application. In particular, disaster-related applications e.g., flood monitoring or rapid damage mapping, which are highly time and data-critical, require methods that produce accurate cloud masks in a short time while being able to adapt to large variations in the target domain (induced by atmospheric c...
متن کاملOrthogonal Maximum Margin Projection for Face Recognition
Dimensionality reduction techniques that can introduce low-dimensional feature representation with enhanced discriminatory power are of paramount importance in face recognition. In this paper, a novel subspace learning algorithm called orthogonal maximum margin projection(OMMP) is proposed. The OMMP algorithm is based on the maximum margin projection (MMP), which aims at discovering both geomet...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition
دوره 45 شماره
صفحات -
تاریخ انتشار 2012